BigQuery for Data Warehousing by Mark Mucchetti
Author:Mark Mucchetti
Language: eng
Format: epub
ISBN: 9781484261866
Publisher: Apress
Aha! Orders over $200 fail at five times the rate of orders below $200. In fact, there is a steady curve as order price increases, leading to extremely high failure rates when orders exceed $500. The team then joins back in the specific errors that are occurring and ties that to the same graph.
Figure 12-3Same graph with error type included
Now itâs crystal clear: this elevated error rate is caused by an error saying that the bank declined the card for insufficient funds. As customers try to charge cards for higher and higher dollar amounts, they are more likely to be declined for this reason. It turns out thereâs no problem at all. The issue is caused by customers trying to exceed their credit limits. Relieved, the team leaves the new event sinks in place, creates some additional alerting to fire if this expectation is violated, and goes off to happy hour.
Itâs easy to see how this might have spiraled out of control in a less sophisticated organization. Pulling these datasets manually from multiple systems and consolidating them would lead to hours of work to look at a single static view. An inconsistent data warehouse might have made it prohibitively difficult to marry the error logs and the user account data. And as the team was analyzing increasingly stale data, they would have been unable to defend or explain new cases still arriving from customers.
This is a key insight for any business process. Shortening your feedback loop allows you to surface and react to relevant information as soon as itâs generated. Now that the team knows this could be an issue, they can monitor it continuouslyâand should a cluster of customers report the issue again, they can quickly determine if it has the same root cause or if a new issue has surfaced.
Using this as a template, you can easily imagine constructing more sophisticated scenarios. A live clickstream could be integrated to show how users react to unexpected conditions during payment and used to improve the website experience. An alert could be set on one customer receiving large amounts of errors from many different credit cards, indicating potentially fraudulent transactions. You could see if users who experience these errors are likely to succeed at purchasing on another card or at a later time. Or you could just see if users who encounter errors on one order are more or less likely to become repeat purchasers. These are all ways you might fruitfully integrate Cloud Logging with BigQuery (or, generically, application performance monitoring with your data warehouse tool.)
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(8301)
Azure Data and AI Architect Handbook by Olivier Mertens & Breght Van Baelen(6746)
Building Statistical Models in Python by Huy Hoang Nguyen & Paul N Adams & Stuart J Miller(6723)
Serverless Machine Learning with Amazon Redshift ML by Debu Panda & Phil Bates & Bhanu Pittampally & Sumeet Joshi(6602)
Data Wrangling on AWS by Navnit Shukla | Sankar M | Sam Palani(6383)
Driving Data Quality with Data Contracts by Andrew Jones(6333)
Machine Learning Model Serving Patterns and Best Practices by Md Johirul Islam(6096)
Learning SQL by Alan Beaulieu(5995)
Weapons of Math Destruction by Cathy O'Neil(5779)
Big Data Analysis with Python by Ivan Marin(5367)
Data Engineering with dbt by Roberto Zagni(4364)
Solidity Programming Essentials by Ritesh Modi(4012)
Time Series Analysis with Python Cookbook by Tarek A. Atwan(3873)
Pandas Cookbook by Theodore Petrou(3582)
Blockchain Basics by Daniel Drescher(3294)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2905)
Feature Store for Machine Learning by Jayanth Kumar M J(2815)
Learn T-SQL Querying by Pam Lahoud & Pedro Lopes(2796)
Mastering Python for Finance by Unknown(2744)
